SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Band, Vimla 

(ii) TITLE OF INVENTION: NES-1 POLYPEPTIDES, DNA, AND RELATED 

MOLECULES AND METHODS 

(iii) NUMBER OF SEQUENCES; 11 

(iv) CORRESPONDENCE ADDRESS: 

" ' - (A) ADDRESSEE: Fish & Richardson P.C- 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2804 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA; 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/467,155 

(B) FILING DATE: G6-JUN-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Clark, Paul T, 

(B) REGISTRATION NUMBER: 30,162 

(C) REFERENCE / DOCKET NUMBER: 00398/100002 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617/542-5070 

(B) TELEFAX: 617/542-8906 

(C) TELEX: 200154 



(2) INFORMATION FOR SEQ ID NOsl: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Arg Ala Pro His Leu His Leu Ser Ala Ala Ser Gly Ala Arg Ala 
1 5 10 15 



Ala Ala Leu Leu Pro Gin Asn Asp Thr Arg Leu Asp Pro Glu Ala Tyr 
35 40 " 45 

Gly Ala Pro Cys Ala Arg Gly Ser Gin Pro Trp Gin Val Ser Leu Phe 
50 55 60 

Asn Gly Leu Ser Phe His Cys Ala Gly Val Leu Val Asp Gin Ser Trp 
65 70 75 80 

val Leu Thr Ala Ala His Cys Gly Asn Lys Pro Leu Trp Ala Arg Val 
85 90 95 

Gly Asp Asp His Leu Leu Leu Leu Gin Gly Glu Gin Leu Arg Arg Thr 
. : ' 100 105 * 110 

Thr Arg Ser Val Val His Pro Lys Tyr His Gin Gly Ser Gly Pro He 
115 120 125 

Leu Pro Arg Arg Thr Asp Glu His Asp Leu Met Leu Leu Lys Leu Ala 
130 135 140 

Arg Pro Val Val Pro Gly Pro Arg Val Arg Ala Leu Gin Leu Pro Tyr 
145 150 • 155 160 

Arg Cys Ala Gin Pro Gly Asp Gin Cys Gin Val Ala Gly Trp Gly Thr 
165 170 175 

Thr Ala Ala Arg Arg Val Lys Tyr Asn Lys Gly Leu Thr Cys Ser Ser 
180 185 190 

He Thr He Leu Ser Pro Lys Glu Cys Glu Val Phe Tyr Pro Gly Val 
195 200 205 

Val Thr Asn Asn Met He Cys Ala Gly Leu Asp Arg Gly Gin Asp Pro 
210 215 220 

Cys Gin Ser Asp Ser Gly Gly Pro Leu Val Cys Asp Glu Thr Leu Gin 
225 230 235 240 

Gly He Leu Ser Trp Gly Val Tyr Pro Cys Gly Ser Ala Gin His Pro 
245 250 255 

Ala Val Tyr Thr Gin He Cys Lys Tyr Met Ser Trp He Asn Lys Val 
260 265 270 

He Arg Ser Asn 
275 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ACCAGCGGCA GACCACAGGC AGGGCAGAGG CACGTCTGGG TCCCCTCCCT CCTTCCTATC 



GGCGACTCCC AGATCCTGGC CATGAGAGCT CCGCACCTCC ACCTCTCCGC CGCCTCTGGC 120 

GCCCGGGCTC TGGCGAAGCT GCTGCCGCTG CTGATGGCGC AACTCTGGGC CGGAGAGGCG 180 

GCGCTGCTCC CCCAAAACGA CACGCGCTTG GACCCCGAAG CCTATGGCGC CCCGTGCGCG 240 

CGCGGCTCGC AGCCCTGGCA GGTCTCGCTC TTCAACGGCC TCTCGTTCCA CTGCGCGGGT 300 

GTCCTGGTGG ACCAGAGTTG GGTGCTGACG GCCGCGCACT GCGGAAACAA GCCACTGTGG 360 

GCTCGAGTAG GGGATGATCA CCTGCTGCTT CTTCAGGGCG AGCAGCTCCG CCGGACGACT 420 

CGCTCTGTTG TCCATCCCAA GTACCACCAG GGCTCAGGCC CCATCCTGCC AAGGCGAACG 480 

GATGAGCACG ATCTCATGTT GCTAAAGCTG GCCAGGCCCG TAGTGCCGGG GCCCCGCGTC 540 

CGGGCCCTGC AGCTTCCCTA CCGCTGTGCT CAGCCCGGAG ACCAGTGCCA GGTTGCTGGC 600 

TGGGGCACCA CGGCCGCCCG GAGAGTGAAG TACAACAAGG GCCTGACCTG CTCCAGCATC 660 

ACTATCCTGA GCCCTAAAGA GTGTGAGGTC TTCTACCCTG GCG7GGTCAC CAACAACATG 720 

ATATGTGCTG GACTGGACCG GGGCCAGGAC CCTTGCCAGA GTGACTCTGG AGGCCCCCTG 780 

GTCTGTGACG AGACCCTCCA AGGCATCCTC TCGTGGGGTG TTTACCCCTG TGGCTCTGCC 840 

CAGCATCCAG CTGTCTACAC CCAGATCTGC AAATACATGT CCTGGATCAA TAAAGTCATA 900 

CGCTCCAACT GATCCAGATG CTACGCTCCA GCTGATCCAG ATGTTATGCT CCTGCTGATC 960 

CAGATGCCCA GAGGCTCCAT CGTCCAXCCT CTTCCTCCCC AGTCGGCTGA ACTCTCCCCT 1020 

TGTCTGCACT GTTCAAACCT CTGCCGCCCT CCACACCTCT AAACATCTCC CCTCTCACCT 1080 

CATTCCCCCA CCTATCCCCA TTCTCTGCCT GTACTGAAGC TGAAATGCAG GAAGTGGTGG 1140 

CAAAGGTTTA TTCCAGAGAA GCCAGGAAGC CGGTCATCAC CCAGCCTCTG AGAGCAGTTA 1200 

CTGGGGTCAC CCAACCTGAC TTCCTCTGCC ACTCCCCGCT GTGTGACTTT GGGCAAGCCA 1260 

AGTGCCCTCT CTGAACCTCA GTTTCCTCAT CTGCAAAATG GGAACAATGA CGTGCCTACC 1320 

TCTTAGACAT GTTGTGAGGA GACTATGATA TAACATGTGT ATGTAAATCT TCATGTGATT 1380 

GTCATGTAAG GCTTAACACA GTGGGTGGTG AGTTCTGACT AAAGGTTACC TGTTGTCGTG 1440 

AAAAAAAAAA AAAA 1454 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CCGCAGATTT AGGTGACAC 



19 



(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



r(xi-) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGCCTCTAAT ACGACTCAC 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear- 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TACCACTACA ATGGATG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GTTGAAGTGA ACTTGCGGGC 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Met Ser Ala Leu Leu lie Leu Ala Leu Val Gly Ala Ala Val Ala Phe 
1 5 10 15 

Pro Val Asp Asp Asp Asp Lys lie Val Gly Gly Tyr Met Met Thr Arg 
20 25 30 

Tyr Ala Arg Thr Cys Xrg Glu Ser Ser Val Pro Tyr Gin Val Ser Leu 
35 40 45 

Asn Ala Gly Tyr His Phe Cys Gly Gly Ser Leu lie Aan Asp Gin Trp 
50 55 60 

" Val Val Ser Ala Ala His Cys Tyr Lys Tyr Arg lie Gin Val Arg Leu 
65 70 75 80 

Gly Glu His Asn Met Met Thr Arg Tyr Ala Arg He Asn Val Leu Glu 
85 90 95 

Gly Asn Glu Gin Phe Val Asp Ser Ala Lys He He Arg His Pro Asn 
100 105 110 

Tyr Asn Ser Trp Thr Leu Asp Asn Asp He Met Leu He Lys Leu Ala 
115 120 125 

Ser Pro Val Thr Leu Met Met Thr Arg Tyr Ala Arg Asn Ala Arg Val 
130 135 140 

Ala Ser Val Pro Leu Pro Ser Ser Cys Ala Pro Ala Gly Thr Gin Cys 
145 150 155 160 

Leu He ser Gly Trp Gly Asn Thr Leu Ser Asn Gly Val Asn Asn Pro 
165 170 175 

Asp Leu Leu Gin Cys Val Asp Ala Pro Val Leu Pro Gin Ala Met Met 
180 185 190 

Thr Arg Tyr Ala Arg Asp Cys Glu Ala Ser Tyr Pro Gly Asp He Thr 
195 200 205 

Asn Asn Met He Cys Val Gly Phe Leu Glu Gly Gly Lys Asp Ser Cys 
210 215 220 

Gin Gly Asp Ser Gly Gly Pro Val Val Cys Asn Gly Glu Leu Gin Gly 
225 230 235 240 

He val Ser Trp Gly Tyr Met Met Thr Arg Tyr Ala Arg Gly Cys Ala 
245 250 255 

Gin Pro Asp Ala Pro Gly Val Tyr Thr Lys Val Cys Asn Tyr Val Asp 
260 265 270 

Trp He Gin Asn Thr He Ala Asp Asn 
275 280 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 299 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Leu Glu Leu His Pro^Lfcu Leu Gly Gly Arg Thr Trp Arg Ala Ala Arg 
15 10 15 

Asp Ala Asp Gly Cys Glu Ala Leu Gly Thr Val Ala Val Pro Phe Asp 
20 25 30 

\ Asp Asp Asp Lys He Val Gly Gly Tyr His Ser Thr Arg Tyr He Val 
35 40 45 

Aax Thr Cys Glu Asn Ser Leu Pro Tyr Gin val ser Leu Aan Ser Gly 
50 55 60 

Ser His Phe Cys Gly Gly Ser Leu He Ser Glu Gin Trp Val Val Ser 
65 70 75 80 

Ala Ala His Cys Tyr Lys Thr Arg lie Gin Val Arg Leu Gly Glu His 
85 90 95 

Asn His Ser Thr Arg Tyr He Val Asx He Lys Val Leu Glu Gly Asn 
100 105 110 

Glu Gin Phe He Asn Ala Ala Lys He He Arg His Pro Lys Tyr Asn 
115 120 12S 

Arg Asp Thr Leu Asp Asn Asp He Met Leu lie Lys Leu Ser Ser Pro 
130 135 140 

Ala Val He His Ser Thr Arg Tyr He Val Asx Asn Ala Arg Val Ser 
145 150 155 160 

Thr He Ser Leu Pro Thr Ala Pro Pro Ala Ala Gly Thr Glu Cys Leu 
165 170 * 175 

He Ser Gly Trp Gly Asn Thr Leu Ser Phe Gly Ala Asp Tyr Pro Asp 
180 185 190 

Glu Leu Lys Cys Leu Asp Ala Pro Val Leu Thr Gin Ala His ser Thr 
195 200 205 

Arg Tyr He Val Aax Glu Cys Lys Ala Ser Tyr Pro Gly Lys He Thr 
210 215 220 

Asn Ser Met Phe Cys Val Gly Phe Leu Glu Gly Gly Lys Asp Ser Cys 
225 230 235 240 

Gin Arg Asp Ser Gly Gly Pro Val Val Cys Asn Gly Gin Leu Gin Gly 
245 250 255 

Val Val Ser Trp Gly His His Ser Thr Arg Tyr He Val Asx Gly Cys 
260 265 270 

Ala Trp Lys Asn Arg Pro Gly Val Tyr Thr Lys Val Tyr Asn Tyr Val 
275 280 285 

Asp Trp He Lys Asp Thr He Ala Ala Asn Ser 
290 295 



(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



.(xil SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Lys lie Cys He Phe Phe Thr Leu Leu Gly Thr Val Ala Ala Phe 
1 5 10 IS 

Pro Thr Glu Asp Asn Asp Asp Arg He Val Gly Gly Tyr Arg Asn Thr 
20 25 30 

Arg Tyr Pro Val Asx Thr Cys Gin Glu His Ser Val Pro Tyr Gin Val 
35 40 45 

Ser Leu Asn Ala Gly Ser His He Cys Gly Gly Ser Leu He Thr Asp 
50 55 60 

Gin Trp Val Leu Ser Ala Ala His Cys Tyr His Pro Gin Leu Gin Val 
65 70 75 80 

Arg Leu Gly Glu His Asn Arg Asn Thr Arg Tyr Pro Val Asx He Tyr 
85 90 95 

Glu He Glu Gly Ala Glu Gin Phe He Asp Ala Ala Lys Met He Leu 
100 105 110 

His Pro Asp Tyr Asp Lys Trp Thr Val Asp Asn Asp He Met Leu He 
115 120 125 

Lys Leu Lys Ser Pro Ala Thr Leu Arg Asn Thr Arg Tyr Pro Val Asx 
130 135 140 

Asn Ser Lys Val Ser Thr He Pro Leu Pro Gin Tyr Cys Pro Thr Ala 
145 150 155 160 

Gly Thr Glu Cys Leu Val Ser Gly Trp Gly Val Leu Lys Phe Gly Phe 
165 170 175 

Glu Ser Pro Ser Val Leu Gin Cys Leu Asp Ala Pro Val Leu Ser Asp 
180 185 190 

Ser Arg Asn Thr Arg Tyr Pro Val Asx Val Cys His Lys Ala Tyr Pro 
195 200 205 

Arg Gin He Thr Asn Asn Met Phe Cys Leu Gly Phe Leu Glu Gly Gly 
210 215 220 

Lys Asp Ser Cys Gin Tyr Asp Ser Gly Gly Pro Val Val Cys Asn Gly 
225 230 235 240 

Glu Val Gin Gly He Val Ser Trp Gly Asp Arg Asn Thr Arg Tyr Pro 
245 250 255 



Val Asx Gly Cys Ala Leu Glu Gly Lys Pro Gly Val Tyr Thr Lys Val 
260 265 270 

Cys Aan Tyr Lau Asn Trp He Gin Gin Thr Val Ala Ala Aan 
275 230 285 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 amino acids 

(B) TYPE: amino acid 

(C) STRANDED NESS : not relevant: 

(D) TOPOLOGY: linear 

^(li> MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ala Ala Phe Ala Thr Glu Asp Asp Lys lie Val Gly Gly Tyr Ser Ser 
1 5 '10 15 

Thr Arg Tyr Pro He He Glu Cys Lys Ala Tyr Ser Gin Pro His Gin 
20 25 30 

Val Ser Leu Asn Ser Gly Tyr His Phe Cys Gly Gly Ser Leu Val Asn 
35 40 45 

Glu Asn Trp Val Val Ser Ala Ala His cys Tyr Gin Ser Arg Val Glu 
50 55 60 

Val Arg Leu Gly Glu His Asn Ser Ser Thr Arg Tyr Pro He He He 
65 70 75 80 

Gin Val Thr Glu Gly ser Glu Gin Phe He ser Ser Ser Arg Val He 
85 90 95 

Arg His Pro Asn Tyr ser Ser Tyr Asn He Asp Asn Asp He Met Leu 
100 105 110 

He Lys Leu Ser Lys Pro Ala Thr Leu Ser Ser Thr Arg Tyr Pro He 
115 120 125 

He Asn Thr Tyr Val Gin Pro Val Ala Leu Pro Thr Ser Cys Ala Pro 
130 135 140 

Ala Gly Thr Met Cys Thr Val Ser Gly Trp Gly Asn Thr Met Ser Ser 
145 150 155 160 

Thr Ala Asp Lys Asn Lys Leu Gin Cys Leu Asn He Pro He Leu Ser 
165 170 175 

Tyr Ser Ser Ser Thr Arg Tyr Pro He He Asp Cys Asn Asn Ser Tyr 
180 185 190 

Pro Gly Met He Thr Asn Ala Met Phe Cys Ala Gly Tyr Leu Glu Gly 
195 200 205 

Gly Lys Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Val Val Cys Asn 
210 215 220 



Gly Glu Leu Gin Gly Val Val Ser Trp Gly Tyr Ser Ser Thr Arg Tyr 
225 230 235 240 

Pro lie lie Gly cys Ala Glu Pro Gly Aan Pro Gly Val Tyr Ala Lys 
245 250 255 

Val Cys lie Phe Asn Asp Trp Leu Thr Ser Thr Met Ala Thr Tyr 
260 ' - 265 270 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 291 amino acids 
* " • - (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Arg Ala Pro His Leu His Leu Ser Ala Ala Ser Gly Ala Arg Ala 
15 10 15 

Leu Ala Lys Leu Leu Pro Leu Leu Met Ala Gin Leu Trp Ala Ala Glu 
20 25 30 

Ala Ala Leu Leu Pro Gin Asn Asp Thr Arg Leu Asp Pro Glu Ala Tyr 
35 40 45 

Gly Ala Asn Glu Ser Pro Cys Ala Arg Gly Ser Gin Pro Trp Gin Val 
50 55 60 

Ser Leu Phe Asn Gly Leu Ser Phe His Cys Ala Gly Val Leu Val Asp 
65 70 75 80 

Gin Ser Trp Val Leu Thr Ala Ala His Cys Gly Asn Lys Pro Leu Trp 
85 90 ~ 95 

Ala Arg Val Gly Asp Asp His Asn Glu Ser Leu Leu Leu Leu Gin Gly 
100 105 110 

Glu Gin Leu Arg Arg Thr Thr Arg Ser Val Val His Pro Lys Tyr His 
115 120 125 

Gin Gly Ser Gly Pro lie Leu Pro Arg Arg Thr Asp Glu His Asp Leu 
130 135 140 

Met Leu Leu Lys Leu Ala Arg Pro Val Val Pro Asn Glu Ser Gly Pro 
145 150 155 160 

Arg Val Arg Ala Leu Gin Leu Pro Tyr Arg Cys Ala Gin Pro Gly Asp 
165 170 175 

Gin Cys Gin Val Ala Gly Trp Gly Thr Thr Ala Ala Arg Arg Val Lys 
180 185 190 

Tyr Asn Lys Gly Leu Thr cys ser ser lie Thr lie Leu Ser Pro Lys 
195 200 205 



Asn Glu Ser Glu Cys Glu Val Phe Tyr Pro Gly Val Val Thr Asn Asn 
210 215 220 

Met lie Cys Ala Gly Leu Asp Arg Gly Gin Asp Pro Cys Gin Ser Asp 
225 230 235 240 

Ser Gly Gly Pro Leu Val Cys Asp Glu Thr Leu Gin Gly lie Leu Ser 
245 • 250 255 

Trp Gly Val Tyr Asn Glu ser Pro Cys Gly Ser Ala Gin His Pro Ala 
260 265 270 

Val Tyr Thr Gin lie Cys Lys Tyr Met Ser Trp lie Asn Lys Val lie 
' - - - 275 280 285 

Arg Ser Asn 
290 



